Why Maximum Entropy? A Non-axiomatic Approach
نویسنده
چکیده
Ill-posed inverse problems of the form y = Xp where y is J-dimensional vector of a data, p is m-dimensional probability vector which can not be measured directly and matrix X of observable variables is a known J ×m matrix, J < m, are frequently solved by Shannon’s entropy maximization (MaxEnt, ME). Several axiomatizations were proposed (see for instance [1], [2], [3], [4], [5], [6], [7], [8], as well as [9] for a critique of some of them) to justify the MaxEnt method (also) in this context. The main aim of the presented work is two-fold: 1) to view the concept of complementarity of MaxEnt and Maximum Likelihood (ML) tasks introduced at [10] from a geometric perspective, and consequently 2) to provide an intuitive and non-axiomatic answer to the ’Why MaxEnt?’ question. INTRODUCTION The concept of complementarity of maximum entropy and maximum likelihood tasks, proposed at [10], is in this vignette interpreted from a geometrical point of view of vectors. Two key notions are introduced: collinearity and coherence. In addition to shaping the complementarity into an elegant form the collinearity/coherence concepts offer an elementary answer to the persistent ’Why MaxEnt?’ question. COLLINEARITY AND COHERENCE: ML AND MAXENT Definition 1. System of events {A1,A2, . . . ,Am} is equivalently described by its distribution p = [p1,p2, . . . ,pm], or by its potential u = [u1,u2, . . . ,um]. The relationship of equivalence is
منابع مشابه
Axiomatic Characterizations of Information Measures
Axiomatic characterizations of Shannon entropy, Kullback I-divergence, and some generalized information measures are surveyed. Three directions are treated: (A) Characterization of functions of probability distributions suitable as information measures. (B) Characterization of set functions on the subsets of {1, . . . , N} representable by joint entropies of components of an N -dimensional rand...
متن کاملMaximum Entropy and Maximum Probability
Sanov’s Theorem and the Conditional Limit Theorem (CoLT) are established for a multicolor Pólya Eggenberger urn sampling scheme, giving the Pólya divergence and the Pólya extension to the Maximum Relative Entropy (MaxEnt) method. Pólya MaxEnt includes the standard MaxEnt as a special case. The universality of standard MaxEnt advocated by an axiomatic approach to inference for inverse problems i...
متن کاملAxiomatic Definition of Entropy for Nonequilibrium States
In introductory courses and textbooks on elementary thermodynamics, entropy is often presented as a property defined only for equilibrium states, and its axiomatic definition is almost invariably given in terms of a heat to temperature ratio, the traditional Clausius definition. Teaching thermodynamics to undergraduate and graduate students from all over the globe, we have sensed a need for mor...
متن کاملMaximum Entropy method with non-linear moment constraints: challenges
Traditionally, the Method of (Shannon-Kullback’s) Relative Entropy Maximization (REM) is considered with linear moment constraints. In this work, the method is studied under frequency moment constraints which are non-linear in probabilities. The constraints challenge some justifications of REM since a) axiomatic systems are developed for classical linear moment constraints, b) the feasible set ...
متن کاملCombinatorial Information Theory: I. Philosophical Basis of Cross-Entropy and Entropy
This study critically analyses the information-theoretic, axiomatic and combinatorial philosophical bases of the entropy and cross-entropy concepts. The combinatorial basis is shown to be the most fundamental (most primitive) of these three bases, since it gives (i) a derivation for the Kullback-Leibler cross-entropy and Shannon entropy functions, as simplified forms of the multinomial distribu...
متن کامل